Adapting and evaluating a generic term extraction tool

نویسندگان

  • Anita Gojun
  • Ulrich Heid
  • Bernd Weißbach
  • Carola Loth
  • Insa Mingers
چکیده

On the basis of the term candidate extraction tools under development in the EU project TTC, we designed an application for German and English data that serves as a first evaluation of the approach and of the techniques for monolingual term candidate extraction used in the project. The application situation highlighted, among others, the need for tools to remove incomplete word sequences from multi-word term candidate lists, as well as the fact that the provision of German citation forms requires more morphological knowledge than TTC’s slim approach can provide. In the detailed evaluation of our extraction results, we profited from interaction with domain experts and from the fact that the same texts were used for both manual and automatic term extraction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Refactoring of a Database

The technique of database refactoring is all about applying disciplined and controlled techniques to change an existing database schema. The problem is to successfully create a Database Refactoring Framework for databases. This paper concentrates on the feasibility of adapting this concept to work as a generic template. To retain the constraints regardless of the modifications to the metadata, ...

متن کامل

IPhraxtor - A linguistically informed system for extraction of term candidates

In this paper a method and a flexible tool for performing monolingual term extraction is presented, based on the use of syntactic analysis where information on parts-of-speech, syntactic functions and surface syntax tags can be utilised. The standard approaches to evaluating term extraction, namely by manual evaluation of the top n term candidates or by comparing to a gold standard consisting o...

متن کامل

Generic Tools, Specific Languages

Adapting tools to a particular domain is expensive, and the adaptation is often not very deep. To address this challenge, Generic Tools, Specific Languages shifts the focus from building and adapting tools (windows, buttons, algorithms) to building and adapting languages to a domain. The thesis applies the approach to embedded software development: mbeddr is an extensible set of integrated lang...

متن کامل

Evaluating the Effectiveness of Deductive and Inductive Form-Focused In-struction on Iranian EFL Learners' Implicit and Explicit Knowledge of Non-Generic Definite Article

This study investigated the relative effects of deductive and inductive form-focused instruction (FFI) on the acquisition of 4 non-generic definite article uses(cultural, situational, structural, and textual) as assessed by explicit and implicit outcome measures. The tests utilized to assess EFL learners' acquisition of definite article uses were timed and untimed grammaticality judgment tests....

متن کامل

Adapting a Generic Platform for Poetry Generation to Produce Spanish Poems

PoeTryMe was created as a generic system for the generation of poetry that takes into account both semantics, in the form of triplets of relations between concepts, and textual structure, in the form of a grammar of templates extracted from existing poems. It was originally instantiated to generate poetry in Portuguese. The present paper describes an effort to create a different instantiation o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012